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Abstract 

An edge-colored graph G is rainbow connected if any two vertices are connected by a path 
whose edges have distinct colors. The rainbow connection of a connected graph G, denoted 
rc(G), is the smallest number of colors that are needed in order to make G rainbow connected. 
In the first result of this paper we prove that computing rc{G) is NP-Hard solving an open 
problem from [6j. In fact, we prove that it is already NP-Complete to decide if rc(G) = 2, and 
also that it is NP-Complete to decide whether a given edge-colored (with an unbounded number 
of colors) graph is rainbow connected. On the positive side, we prove that for every e > 0, 
a connected graph with minimum degree at least en has bounded rainbow connection, where 
the bound depends only on e, and a corresponding coloring can be constructed in polynomial 
time. Additional non-trivial upper bounds, as well as open problems and conjectures are also 
presented. 

1 Introduction 

Connectivity is perhaps the most fundamental graph-theoretic property, both in the combinatorial 
sense and the algorithmic sense. There are many ways to strengthen the connectivity property, 
such as requiring hamiltonicity, fe-connectivity, imposing bounds on the diameter, requiring the 
existence of edge-disjoint spanning trees, and so on. 

An interesting way to quantitavely strengthen the connectivity requirement was recently intro- 
duced by Chartrand et al. in [5]. An edge-colored graph G is rainbow connected if any two vertices 
are connected by a path whose edges have distinct colors. Clearly, if a graph is rainbow connected, 
then it is also connected. Conversely, any connected graph has a trivial edge coloring that makes 
it rainbow connected; just color each edge with a distinct color. Thus, one can properly define the 
rainbow connection of a connected graph G, denoted rc(G), as the smallest number of colors that 
are needed in order to make G rainbow connected. An easy observation is that if G is connected 
and has n vertices then rc{G) < n — 1, since one may color the edges of a given spanning tree with 
distinct colors. We note also the trivial fact that rc{G) = 1 if and only if G is a clique, the (almost) 
trivial fact that rc{G) = n — 1 if and only if G is a tree, and the easy observation that a cycle with 
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A; > 3 vertices has rainbow connection \k/2\. Also notice that, clearly, rc{G) > diam(G) where 
diam(G) denotes the diameter of G. 

Chartrand et al. computed the rainbow connection of several graph classes including complete 
multipartite graphs [5]. Caro et al. [6] considered the extremal graph-theoretic aspects of rainbow 
connection. They proved that if G is a connected graph with n vertices and with minimum degree 
3 then rc{G) < 5n/6, and if the minimum degree is 6 then rc{G) < ^^n(l + f{6)) where f{5) tends 
to zero as 6 increases. They also determine the threshold function for a random graph G{n,p{n)) 
to have rc{G) = 2. In their paper, they conjecture that computing rc{G) is an NP-Hard problem, 
as well as conjecture that even deciding whether a graph has rc{G) = 2 in NP-Complete. 

In this paper we address the computational aspects of rainbow connection. Our first set of 
results solve, and extend, the complexity conjectures from [6]. Indeed, it turns out that deciding 
whether rc{G) = 2 is an NP-Complete problem. Our proof is by a series of reductions, where 
on the way it is shown that 2-rainbow-colorability is computationally equivalent to the seemingly 
harder question of deciding the existence of a 2-edge-coloring that is required to rainbow-connect 
only vertex pairs from a prescribed set. 

Theorem 1.1 Given a graph G, deciding if rc{G) = 2 is NP-Complete. In particular, computing 
rc{G) is NP-Hard. 

Suppose we are given an edge coloring of the graph. Is it then easier to verify whether the colored 
graph is rainbow connected? Clearly, if the number of colors in constant then this problem becomes 
easy. However, if the coloring is arbitrary, the problem becomes NP-Complete: 

Theorem 1.2 The following problem is NP-Complete: Given an edge-colored graph G, check 
whether the given coloring makes G rainbow connected. 

For the proof of Theorem 11.21 we first show that the s — t version of the problem is NP-Complete. 
That is, given two vertices s and t of an edge-colored graph, decide whether there is a rainbow path 
connecting them. 

We now turn to positive algorithmic results. Our main positive result is that connected n-vertex 
graphs with minimum degree 0(n) have bounded rainbow connection. More formally, we prove: 

Theorem 1.3 For every e > there is a constant C = C(e) such that if G is a connected graph with 
n vertices and minimum degree at least en, then rc{G) < C. Furthermore, there is a polynomial 
time algorithm that constructs a corresponding coloring for a fixed e. 

The proof of Theorem 11.31 is based upon a modified degree-form version of Szemeredi's Regularity 
Lemma that we prove and that may be useful in other applications. From our algorithm it is 
also not hard to find a probabilistic polynomial time algorithm for finding this coloring with high 
probability (using on the way the algorithmic version of the Regularity Lemma from [1] or [7]). 

We note that connected graphs with minimum degree en have bounded diameter, but the latter 
property by itself does not guarantee bounded rainbow connection. As an extreme example, a star 
with n vertices has diameter 2 but its rainbow connection is n — 1. The following theorem asserts 
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however that having diameter 2 and only logarithmic minimum degree suffices to guarantee rainbow 
connection 3. 

Theorem 1.4 If G is an n-vertex graph with diameter 2 and minimum degree at least 81ogn then 
rc{G) < 3. Furthermore, such a coloring is given with high probability by a uniformly random 
3-edge-coloring of the graph G, and can also be found by a polynomial time deterministic algorithm. 

Since a graph with minimum degree n/2 is connected and has diameter 2, we have as an immediate 
corollary: 

Corollary 1.5 If G is an n-vertex graph with minimum degree at least n/2 then rc{G) < 3. 

The rest of this paper is organized as follows. The next section contains the hardness results, 
including the proofs of Theorem 1 1 . 1 1 and Theorem ll.2[ Section [3] contains the proof of Theorem 11.31 
and the proof of Theorem ll.4[ At the end of the proof of each of the above theorems we explain how 
the algorithm can be derived - this mostly consists of using the conditional expectation method to 
derandomize the probabilistic parts of the proofs. The final Section H] contains some open problems 
and conjectures. 

2 Hardness results 

We first give an outline of our proof of Theorem 11.11 We begin by showing the computational 
equivalence of the problem of rainbow connection 2, that asks for a red-blue edge coloring in which 
all vertex pairs have a rainbow path connecting them, to the problem of subset rainbow connection 
2, asking for a red-blue coloring in which every pair of vertices in a given subset of pairs has a 
rainbow path connecting them. This is proved in Lemma |2. II below. 

In the second step, we reduce the problem of extending to rainbow connection 2, asking whether 
a given partial red-blue coloring can be completed to a obtain a rainbow connected graph, to the 
subset rainbow connection 2 problem. This is proved in Lemma 12.21 below. 

Finally, the proof of Theorem 11.11 is completed by reducing 3-SAT to the problem of extending 
to rainbow connection 2. 

Lemma 2.1 The following problems are polynomially equivalent: 

1. Given a graph G decide whether rc{G) = 2. 

2. Given a graph G and a set of pairs P Q V{G) x V{G), decide whether there is an edge coloring 
of G with 2 colors such that all pairs (n, f ) G P are rainbow connected. 

Proof. It is enough to describe a reduction from Problem 2 to Problem 1. Given a graph G = 
(y, E) and a set of pairs P C y x we construct a graph G' = {V' , E') as follows. 

For every vertex v we introduce a new vertex x^, and for every pair (n, ^ {V xV)\P 
we introduce a new vertex . We set 

y = y U {x, : e y} U : {u,v) e{VxV)\P} 
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and 

E' = £;u{{i;,x„} : v € ^}u{{u, {t;, } : {u,v) G {VxV)\pju{{x,x'} :x,x' e F^}. 

It remains to verify that G' is 2-rainbow connected if and only if there is an edge coloring of 
G with 2 colors such that all pairs (u, v) e P are rainbow connected. In one direction, notice that 
when G is considered as a subgraph of G', no pair of vertices of G that appear in P has a path of 
length two in G' that is not fully contained in G. Hence, if G is not colorable in a way connecting 
the pairs in P, the graph G' is not 2-rainbow-connccted. 

In the other direction, assume that x '■ E {red, blue} is a coloring of G that rainbow-connects 
the pairs in P. To extend it to a rainbow-coloring x' '■ E' ^ {red, blue}, define x'ii'^^^v}) = blue 
for all V €V, x' ^{u,v)}) = blue and x'i{iJjX{u,v)}) = red for all {u,v) P (note that we treat 
P as a set of ordered pairs - an unordered pair can be represented by putting both orderings in 
P), and finally x' {{^i ^'}) = I'cd for all x,x' € V' \ V. One can see that x' is indeed a valid 
2-rainbow-coloring of G' , concluding the proof. ■ 

Lemma 2.2 The first problem defined below is polynomially reducible to the second one: 

1. Given a graph G = {V,E) and a partial 2-edge-coloring x : E —>■ {0,1} for E d E, decide 
whether x can be extended to a complete 2 edge-coloring % : — > {0, 1} that makes G rainbow 
connected. 

2. Given a graph G and a set of pairs P C V{G) x V{G) decide whether there is an edge coloring 
of G with 2 colors such that all pairs {u, v) E P are rainbow connected. 

Proof. Since the identity of the colors does not matter, it is more convenient that instead of a 
coloring x '■ E —>■ {0,1} we consider the corresponding partition = {Ei,E2) of E. Similarly, 
in the case of a partial coloring Xi the pair tt^ = {E\,E2) will contain the corresponding disjoint 
subsets of E (which may not cover E). 

Now, given such a partial coloring x extend the original graph G = (V, E) to a graph 
G' = iy' , E'), and define a set P of pairs of vertices such that for the resulting graph the answer 
for Problem 2 is "yes" if and only if the answer for Problem 1 for the original graph is "yes" . 

Let i : V — > \\V\\ be arbitrary linear ordering of the vertices, and let high : E ^ V he. a 
mapping that maps an edge e = {u,v} to u if i{u) > £{v), and to v otherwise. Similarly, let 
low : ^ F be a mapping that maps an edge e = {u, v} to u if £{u) < i{v), and to v otherwise. 

We construct G' as follows. We add 3 + |^i| + |^2| new vertices 

{6i,c,62}U{ce : e G (EiUiz)} 

and add the edges 

{{6i,c},{c,62}} U {{6i,Ce} : i G {1,2}, e e ^i} U {{ce,low(e)} : e G (A UE^s)}- 
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Now we define the set P of pairs of vertices that have to be 2-rainbow connected: 
P = {{h,b2}} U {{u,v} ■.u,v£v}u {{c,Ce} : e G (^i U^s)} 

u{{6i, low(e)} : i G {1, 2}, e G i^} U {{c^, high(e)} : e G (^i U ^2)}- 

Now, if there is a 2-rainbow-coloring vr^ = (Ei, E2) of G which extends vr-j^ = (E'l, £"2), then we 
color G' as fohows. E is colored as mandated by Xi that is, an edge is colored red if it is in Ei 
and otherwise it is colored blue. Now {61, c}, {62; Ce} for e £ E2 and {cg, low(e)} for e G -Bi are all 
colored blue, and {62, c}, {61, Ce} for e € Ei and {ce,low(e)} for e £ E2 are all colored red. One 
can see that this coloring indeed rainbow-connects all the pairs in P. 

On the other hand, any 2-edge-coloring of G' that connects the pairs in P clearly contains a 
2-rainbow-coloring of G, because P contains all vertex pairs of G when considered as an induced 
subgraph of G' , and also G' contains no path of length 2 between vertices of G that is not contained 
in G. Also, such a coloring would have to color {c, 61} and {0,^2} differently. It would also have 
to color every {6j,Ce} in a color different from that of {c,bi}, and would hence color {ce,low(e)} 
in a color identical to that of {c,bi}, because it has to be the color different from that of {6j,Ce}. 
Finally, every e £ Ei would have to be colored with the color different from that of {c,bi} so 
as to rainbow-connect high(e) and Cg. This means that the coloring of G' not only provides a 
2-rainbow-coloring of G, but that it also conforms to the original partial coloring x. ■ 

Proof of Theorem 11.11 We show that Problem 1 of Lemma 12.21 is NP-hard, and then deduce 
that 2-rainbow-colorability is NP-Complete by applying Lemma [2.1l and Lemma [2. 21 while observing 
that it clearly belongs to NP. 

We reduce 3-SAT to Problem 1 of Lemma [2. 2 [ Given a 3CNF formula cj) = Cj over variables 
xi,X2, . . . , Xn, we construct a graph G^ and a partial 2-edge coloring x' '■ E{G(j,) — > {0, 1} such that 
there is an extension x of x' that makes rainbow connected if and only if (j) is satisfiable. 

We define G^ as follows: 

V{G^) = {q : i G [m]} U {xi : i e [n]} U {a} 

E{G^) = |{ci,Xj} : Xj G Ci in </)|u|{xi,a} : i G [n]|u|{Q, Cj} : i,j G [m]|u|{xi, Xj} : i,j G [n]| 
and we define the partial coloring x' as follows: 

Vj,jeHX'({cj,Cj}) = 

'^i,je[n\x'{{xi,Xj}) = 
V{a;i,Cj}eE(G^)X'({2;i,Cj}) = if is positive in Cj, 1 otherwise 

while all the edges in |{xj,a} : i G [n]| (and only they) are left uncolored. 

Assuming without loss of generality that all variables in (j) appear both as positive and as 
negative, one can verify that a 2-rainbow-coloring of the uncolored edges corresponds to a satisfying 
assignment of (j) and vice versa. ■ 

The proof of Theorem 11.21 is based upon the proof of the following theorem. 
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Theorem 2.3 The following problem is NP-complete: Given an edge colored graph G and two 
vertices s,t of G, decide whether there is a rainbow path connecting s and t. 



Proof. Clearly the problem is in NP. We prove that it is NP-Complete by reducing 3-SAT to it. 
Given a 3CNF formula (f) = AI!Li over variables xi, X2, • • • , x„, we construct a graph G^ with 
two special vertices s,t and a coloring x '■ E^G^f,) [|E'(G^)|] such that there is a rainbow path 
connecting s and t in G^ if and only if (j) is satisfiable. 

We start by constructing an auxiliary graph G' from (p. The graph G' has 3m + 2 vertices, that 
are partitioned into m + 2 layers Vq,Vi, . . . , Vm, Vm+i, where Vq = {s}, Vm+i = {t} and for each 
i G [m], the layer Vi contains the three vertices corresponding to the literals of q (a clause in (p). 
The edges of G' connect between all pairs of vertices residing in consecutive layers. Formally, 



Intuitively, in our final colored graph G^, every rainbow path from s to t will define a satisfying 
assignment of in a way that for every i £ [m], if the rainbow path contains a vertex v & Vi then 
the literal of Cj that corresponds to v is satisfied, and hence Cj is satisfied. Since any path from s to 
t must contain at least one vertex from every layer Vi, this will yield a satisfying assignment for the 
whole formula (p. But we need to make sure that there are no contradictions in this assignment, 
that is, no opposite literals are satisfied together. For this we modify G' by replacing each literal- 
vertex with a gadget, and we define an edge coloring for which rainbow paths yield only consistent 
assignments. 

For every variable Xj , j G [n] , let Vj-^ ,Vj^, . . . , Vj^ be the vertices of G' corresponding to the 
positive literal Xj, and let Vj^,Vj^, . . . ,Vj^ be the vertices corresponding to the negative literal Xj. 
We can assume without loss of generality that both A: > 1 and i > 1, since otherwise the formula (j) 
can be simplified. For every such variable Xj we also introduce k x i distinct colors a\ i, . . . , al. ^. 
Next, we transform the auxiliary graph G' into the final graph G^. 

For every a G [k] we replace the vertex vj^ that resides in layer (say) Vi with £ + 1 new vertices 
vi,V2, ■ ■ ■ , that form a path in that order. We also connect all vertices in Vi-i to vi and connect 
all vertices in Vi+i to t'^+i. For every 5 G [£], we color the edge {vj,, vt+i} in the new path with the 
color a;^ j^. Similarly, for every b G [£] we replace the vertex vj^ from layer (say) Vi' with k + 1 new 
vertices vi,V2, ■ ■ ■ ,Vk^i that form a path, and connect all vertices in Vi'^i to Ui and all vertices 
in Vi'+i to Vk+i- For every a G [k], we color the edge {va,Va+i} with a;^^. All other edges of G^ 
(which were the original edges of G') are colored with fresh distinct colors. 

Clearly, any path from s to i in G^ must contain at least one of the newly built paths in each 
layer. On the other hand, it is not hard to verify that any two paths of opposite literals of the same 
variable have edges sharing the same color. ■ 

Proof of Theorem II. 2L We reduce from the problem in Theorem 12.31 Given an edge colored 
graph G = {V, E) with two special vertices s and t, we construct a graph G' = (y' , E') and define 
a coloring x' ■ E' ^ of its edges such that s and t are rainbow connected in G if and only if 
the coloring of G' makes G' rainbow connected. 




{n, : 3i G [m + 1] s.t. u G V-i and v G Vi 
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Let V = {vi = s,V2, ■ ■ ■ ,Vn = t} be the vertices of the original graph G. We set 
V = VU{s',t',b}U{s\vlvl...,vl,^^,vl_^,t^} 

and 

E' = Eu\^{s',s},{t',t},{s,s'},{t,t^}^u\^{b,Vi}:ie [n]}u 

u\^{v^,v{} -.iein], i G {1,2}} u{K,?;^^ : i,j G [n], 0,6 G {1,2}}. 
The coloring x' is defined as follows: 

• all edges e € E retain the original color, that is x'{^) = x(e); 

• the edges {t,t'}, {s,b} and |{?^i,f|} : i G [n — 1]} are colored with a special color ci; 

• the edges {s,s'},{t,b} and |{t'j,t'j^} : i G [2,n]} are colored with a special color C2; 

• the edges in ^{vi, 6} : i G [2, n — 1]} are colored with a special color C3; 

• the edges in |{vf,w^} : i,j G [n], a,b {1,2}} are colored with a special color C4. 

One can verify that x' makes G' rainbow connected if and only if there was a rainbow path from s 
to t in G. m 

3 Upper bounds and algorithms 

The proof of our main Theorem 11.31 is based upon a modified degree-form version of Szemeredi's 
Regularity Lemma, that we prove here and that may be useful in other applications. We begin by 
introducing the Regularity Lemma and the already known degree-form version of it. 

3.1 Regularity Lemma 

The Regularity Lemma of Szemeredi [9] is one of the most important results in graph theory and 
combinatorics, as it guarantees that every graph has an e-approximation of constant descriptive 
size, namely a size that depends only on e and not on the size of the graph. This approximation 
"breaks" the graph into a constant number of pseudo-random bipartite graphs. This is very useful in 
many applications since dealing with random-like graphs is much easier than dealing with arbitrary 
graphs. In particular, as we shall see, the Regularity Lemma allows us to prove that graphs with 
linear minimum degree have bounded rainbow connection. 

We first state the lemma. For two nonempty disjoint vertex sets A and S of a graph G, we 
define E{A, B) to be the set of edges of G between A and B. The edge density of the pair is defined 
hy d{A,B) = \E{A,B)\/{\A\\B\). 
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Definition 3.1 (e-regular pair) A pair {A,B) is e-regular if for every A' <Z A and B' B 

satisfying \A'\ > e\A\ and \B'\ > e\B\, we have \d{A',B') - d{A,B)\ < e. 

An e-regular pair can be thought of as a pseudo-random bipartite graph in the sense that it 
behaves almost as we would expect from a random bipartite graph of the same density. Intuitively, 
in a random bipartite graph with edge density d, all large enough sub-pairs should have similar 
densities. 

A partition Vi, . . . ,Vk of the vertex set of a graph is called an equipartition if |Vi| and \ Vj \ differ 
by no more than 1 for all 1 < i < j < (so in particular every Vi has one of two possible sizes). 
The order of an equipartition denotes the number of partition classes {k above). An equipartition 
Vi, . . . ,Vk of the vertex set of a graph is called e-regular if all but at most e(^) of the pairs {Vi, Vj) 
are e-regular. Szemeredi's Regularity Lemma can be formulated as follows. 

Lemma 3.2 (Regularity Lemma [9j) For every e > and positive integer K , there exists N = 
A j^g^ e, K), such that any graph with n > N vertices has an e-regular equipartition of order k, 
where K < k < N . 

As mentioned earlier, the following variation of the lemma comes useful in our context. 

Lemma 3.3 (Regularity Lemma - degree form [8j) For every e > and positive integer K 
there is N = N^j^ e, K) such that given any graph G = (V, E) with n > N vertices, there is a 
partition of the vertex-set V into k + 1 sets Vq, V{, . . . , V^, and there is a subgraph G' of G with the 
following properties: 

1. K <k<N, 

2. s = \Vq \ < e^n and all other components V^ , i G [k] are of size £ = ^^y^, 

3. for all i € [k], V- induces an independent set in G' , 

4- for all i,j G [k], the pair {V-,Vj) is e^ -regular in G' , with density either or at least |, 
5. for all V ^V , degQi{v) > degg(f) — |n. 

This form of the lemma (see e.g. [8j) can be obtained by applying the original Regularity 
Lemma (with a smaller value of e), and then "cleaning" the resulting partition. Namely, adding 
to the exceptional set Vq all components Vi incident to many irregular pairs, deleting all edges 
between any other pairs of clusters that either do not form an e-regular pair or they do but with 
density less than e, and finally adding to Vq also vertices whose degree decreased too much by this 
deletion of edges. 
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3.2 A modified degree form version of the Regularity Lemma 

In order to prove that graphs with hnear minimum degree have bounded rainbow connection, we 
need a special version of the Regularity Lemma, which is stated next. 

Lemma 3.4 (Regularity Lemma - new version) For every e > and positive integer K there 
is N = A p^^ e, K) so that the following holds: If G = {V, E) is a graph with n > N vertices and 
minimum degree at least en then there is a subgraph G" of G, and a partition ofV into Vj", . . . , F^" 
with the following properties: 

1. K <k<N, 

2. for all i G [k], (1 - e)f < \V/'\ < (1 + e^)^, 

3. for all i G [k], V-' induces an independent set in G" , 

4- for all i,j G [k], {V-', Vj') is an e^-regular pair in G" , with density either or at least j^, 

5. for all i G [k] and every v G V" there is at least one other class V" so that the number of 
neighbors of v in G" belonging to is at least 1 1 \ . 

Proof. Given e > and K, let us apply the degree- form Regularity Lemma (Lemma I3.3p . with 
the parameters e and K, and let = A ^^ e, K). Let Vq, V/, . . . , F^' be the partition promised by 
Lemma 13.31 let G' be the corresponding subgraph of G. Recall the parameters s < e^n and 
i = defined in Lemma l3.3i 

Consider a mapping / : VJ{ — > [k] where f{v) = i implies that v has more than neighbors 
in G' that belong to V- . Such a mapping clearly exists since otherwise the degree of f in G would 
have been at most (s — 1 + k^e£) + |n < en. 

Next consider the unique mapping g : [k] ^ V^k]) that maps an index i G [A;] to the subset 
of indices g{i) C [k] such that j G g{i) if and only if d(y^,Vj) > e/4 in G'. We notice that the 
cardinality of g{i) is not too small. Indeed, if v is any vertex of V- then the cardinality of g{i) is at 
least 

{degG,{v) - s) e 
i "^2 

Now consider the following process that creates G" from G' by placing the vertices of Vq one- 
by-one in the other vertex classes. Assume that Vq = {vi, . . . ,Vs} and denote by V^* the extension 
of Vi after placing vt somewhere. Initially, = V/ for i = 1, . . . , k, and eventually Vf = ¥(' for 
i = 1, . . . , /c forming the desired partition. 

At stage t, we want to place vt G Vq somewhere. Consider the subset of indices g{f{vt)). We 
will choose j G g{f{vt)) so that Vj""^ has the smallest cardinality (if there are several candidates, 
we may choose any of them). We define Vj = Vj~^ U {vt} and define = V^~^ for i ^ j. The 
neighbors of vt that are kept in G" are all the neighbors of vt in G' that belong to Vx, where 
X G g{j)- Notice that in particular f{vt) G g{j)- 
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Having created G" it remains to prove its claimed properties. Clearly, properties 1 and 3 hold. 
For property 2, notice that the final sets Vl' = have grown from the initial Vl by no more than 

< —e'n = 2e^ 



and hence 



(ffe) ek k 



< = \Vf\<£ + 2e^j; < (1 + e^)^ 



and property 2 holds. 

For property 4, Let us first prove the requirement on the density. Notice that if the original 
density of {V-,Vj) in G' was 0, it remained so also in G" . Otherwise, if the density of {V/,Vj) in 
G' was at least e/4, the fact that \V^'\ < \V{\ + 2e^^ < 2|F/| imphes that the density of (F/', Vf) is 
larger than ^e, no matter where the neighbors of the newly added vertices reside. 

As for the regularity condition in property 4, notice that if (V^', Vj) had density in G' , then the 
same holds in G" and the pair {V- , Vj) is trivially e^-regular. Now, if (V^', Vj) had density S > e/4 
in G' , there are precisely Si'^ edges between them. In G" the number of edges between them can 
increase by at most 

hence the density may have increased from 6 to at most 5 + 4e^. Since 6 > e/4: and since the pair 
{V/,Vj) was initially e^-regular, it follows that the final pair {V-',Vj') is at least e^-regular. 

For property 5, this is easily checked to hold for vertices not originally from Vq because of 
property 5 of G'. Property 5 also holds for vertices of Vq since when is placed in some Vj, we 
know that it has at least neighbors in V- where i = f{vt). ■ We note that the above a partition 
as guaranteed by our modified version of the Regularity Lemma can be found in polynomial time 
for a fixed e (with somewhat worse constants) , by using the exact same methods that were used in 
[I] for constructing an algorithmic version of the original Regularity Lemma. 

3.3 Proof of Theorem fTSl 

In this section we use our version of the Regularity Lemma to prove Theorem 11.31 First we need 
some definitions. Given a graph G = {V,E) and two subsets 14, V2 C V, let i?(Vi, V2) denote the 
set of edges having one endpoint in Vi and another endpoint in 1^. Given a vertex v, let T(v) 
denote the set of v's neighbors, and for W ^V, let Twiv) denote the set W n T{v). 

For an edge coloring x '■ E ^ C, let vr^ denote the corresponding partition of E into (at most) 
\C\ components. For two edge colorings x s-nd x' ^ we say that x' is a refinement of x if ^x' i^ ^ 
refinement of tt^, which is equivalent to saying that x'(^) = x'(^) always implies xi^) = x(^)- 

Observation 3.5 Let x (^nd x' be two edge-colorings of a graph G, such that x' is a refinement of 
X- For any path P in G, if P is a rainbow path under x, then P is a rainbow path under x' ■ In 
particular, if x makes G rainbow connected, then so does x' ■ ■ 
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We define a set of eight distinct colors C = {ai, 02, as, 04, 61, 62, 63, 64}. Given a coloring x '■ E ^ 
C we say that u,v V are a-rainbow connected if there is a rainbow path from n to u using only 
the colors 01,02,03, 04. We similarly define b-rainbow connected pairs. The following is a central 
lemma in the proof of Theorem 11.31 

Lemma 3.6 For any e > 0, there is N = A j^^]'g] (e) such that any connected graph G = {V, E) with 
n > N vertices and minimum degree at least en satisfies the following. There is a partition 11 of 

V into k < N components Vi,V2, ■ ■ ■ ,Vk, and a coloring x '■ E ^ C such that for every i G [k] and 
every u,v (zVi, the pair u,v is both a-rainbow connected and b-rainbow connected under x- 

Proof. Proof given in Section 13.41 ■ 

Using Lemma [3161 we derive the proof of Theorem 11.31 For a given e > 0, set iV = A [3^ e) and 
set C = |iV + 8. Clearly, any connected graph G = {V, E) with n < C vertices satisfies rc{G) < C. 
So we assume that n > C > N, and let IT = Vi, . . . ,Vk be the partition of V from Lemma 13. 6^ 
while we know that k < N. 

First observe that since the minimal degree of G is en, the diameter of G is bounded by 3/e. 
This can be verified by e.g. by taking an arbitrary vertex r V and executing a BPS algorithm 
from it. Let Li,... ,Lt be the layers of vertices in this execution, where Li are all vertices at 
distance i from r. Observe that since the minimal degree is at least en, the total number of vertices 
in every three consecutive layers must be at least en, thus t < 3/e. Since the same claim holds for 
any r G 1/, this implies that diam(G) < t < 3/e. 

Now let T = {Vt,Et) be a connected subtree of G on at most k ■ diam(G) < '^N vertices 
such that for every i (z [k], Vt nVi ^ 9. Such a subtree must exist in G since as observed earlier, 
diam(G) < 3/e. Let x ■ E ^ C he the coloring from Lemma [3T6l and let TC = {hi, /12, . . . , h^Er]} 
be a set of \Et\ < fresh colors. We refine x by recoloring every € E{T) with color hi G Ti. 

Let x' : E ^ U be the resulting coloring of G. The following lemma completes the proof of 
Theorem 11.31 

Lemma 3.7 The coloring x' makes G rainbow connected. Consequently, rc{G) < \Ex\ + 8 < C. 

Proof. Let u,v € V he any pair of G's vertices. If u and v reside in the same component Vi of the 
partition 11, then (by Lemma l3.6p they are connected by a path P of length at most four, which is 
a rainbow path under the the original coloring x- Since x' is a refinement of x, the path P remains 
a rainbow path under x' as well (see Observation 13. 5p . 

Otherwise, let u € Vi and v & Vj for i ^ j. Let ti and tj he vertices of the subtree T, residing in 
Vi and Vj respectively. By definition of x'j there is a rainbow path from ti to tj using colors from 
TC. Let Pt denote this path. In addition, by Lemma 13.61 we know that for the original coloring 
there is a rainbow path Pa from u to ti using colors oi, . . . , 04 and there is a rainbow path Pi, from 

V to tj using colors 61, . . . , 64. Based on the fact that x' is a refinement of x 5 it is now easy to verify 
that Pt, Pa and Pf, can be combined to form a rainbow path from n to f under x'- ■ 
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This concludes the proof of Theorem apart from the existence of a polynomial time algo- 
rithm for finding this coloring. We note that all arguments above apart from Lemma 13.61 admit 
polynomial algorithms for finding the corresponding structures. The algorithm for Lemma 13.61 will 
be given with its proof. 

3.4 Proof of Lemma 13.61 

First we state another auxiliary lemma, which is proved in the next section. 

Lemma 3.8 For every e > there exists N = A j^^^ (e) such that any graph G = iV, E) with 
n > N vertices and minimum degree at least en satisfies the following: There exists a partition 
U = Vi, . . . ,Vk of V such that for every i € [k] and every u,v G Vi, the number of edge disjoint 
paths of length at most four from u to v is larger than S^logn. Moreover, these sets can be found 
using a polynomial time algorithm for a fixed e. 

Proof, (of Lemma 13. 6p First we apply Lemma 13.81 to get the partition H. Now the proof follows 
by a simple probabilistic argument. Namely, we color every edge e G -E by choosing one of the 
colors in C = {ai, . . . , 04, 6i, . . . , 64} uniformly and independently at random. Observe that a fixed 
path P of length at most four is an a-rainbow path with probability at least 8~^. Similarly, P is 
a 6-rainbow path with probability at least 8~^. So any fixed pair u,v ^ Vi is not both a-rainbow- 
connected and 6-rainbow-connected with probability at most 2(1 — g-^^s-'iogn ^ ^-2^ ^^^^ therefore 
the probability that all such pairs are both a-rainbow connected and 6-rainbow connected is strictly 
positive. Hence the desired coloring must exist. 

To find the coloring algorithmically, we note that for every partial coloring of the edges of 
the graph it is easy to calculate the conditional probability that the fixed pair of vertices n, v is 
not both a-rainbow-connected and 6-rainbow-connected. Therefor we can calculate the conditional 
expectation of the number of pairs that are not so connected for any partial coloring. Now we 
can derandomize the random selection of the coloring above by using the conditional expectation 
method (cf . [2j ) : In every stage we color one of the remaining edges in a way that does not increase 
the conditional expectation of the number of unconnected pairs. Since this expectation is smaller 
than 1 in the beginning, in the end we will have less than 1 unconnected pair, and so all pairs will 
be connected. ■ 

3.5 Proof of Lemma ISTSl 

Given e > let L = A j3^ (e, 1) and set A'^ to be the smallest number that satisfies e^-^ > 8^ log A'^. 
Now, given any graph G = {V, E) with n > N vertices and minimum degree at least en, we apply 
Lemma 13.41 with parameters e and 1. Let IT = Vi,V2, ■ ■ ■ ,Vk be the partition of V obtained from 
Lemma [331 while as promised, k < L = A jg^ e). 

Fix i € [k] and u,v G Vi. From Lemma 13.41 we know that there is a component Va such that u 
has at least neighbors in Va- Similarly, there is a component Vh such that v has at least 
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neighbors in Vf,. Let r„ „ denote the set of u's neighbors in Va, and similarly, let denote v's 
neighbors in Vb- We assume in this proof that Va ^ Vb, and at the end it will be clear that the case 
Va = Vb can only benefit. 

We say that a set Wu = {wi, . . . ,wt} C Vi is distinctly reachable from u if there are distinct 
vertices w[, . . . ,10^ € Tu^a such that for every j £ [t], {wj,Wj} € E. Notice that the collection of 
pairs {wj^w'j} corresponds to a matching in the graph G, where all edges of the matching have one 
endpoint in Vi and the other endpoint in T^^a- Similarly, we say that ^ ^ is distinctly reachable 
from V if there are distinct vertices w'i^...,w[ E T^^b such that for every j € [t], {wj^w'^} G E. 
Observe that it is enough to prove that there exists a set W ^ Vi oi size e^-^ > S^logA^ which 
is distinctly reachable from both u and v. This will imply the existence of 8^ log N edge disjoint 
paths of length four from uto v. 

Our first goal is to bound from below the size of the maximal set as above. Since (by 
Lemma l3.4p Va and Vi are e^-regular pairs with density > ^ and since < e/3, the number of 
edges between Vu^a and Vi is at least ~ \^u,a \ • l^il- Before proceeding, we make the following 
useful observation. 

Observation 3.9 Let H = [A,B) he a bipartite graph with ^\A\\B\ edges. Then H contains a 
matching M of size 7 • 

Proof. Consider the following process that creates M. Initially Mq = 0. Then in step i, we pick an 
arbitrary edge {a, 6} G E(H), set Mj+i = Mi U {a, b} and remove from E{H) all the edges incident 
with either a or b. Clearly, in each step the number of removed edges is bounded by |^| + so 
the process continues for at least \a\+\b\ ~ 7 steps. Hence \M\ = \\^iMi\> 7 • ■ 

Returning to the proof of Lemma [3. 81 by Observation 13. 91 the size of a maximal set Wu as above 
is at least 



e 



ir..a||V-.| > _ (W(3fc))(n/fc) ^ ^ 
^ |r„al + iVil ~ V16 / en/{2,k)+n/k - Uk 



e — — — — — > e ^ > n. 

16 / |r„,a| + ~ V16 / en/{Zk)+n/k " 64A; 

To prove that W = Wu n Wa is large, we similarly use the regularity condition, but now on the 
pair (r„,6, W„). We get, 

\E{Tu,b,Wu)\ > \^v,b\\Wu\. 

Here too, by Observation 13.91 we can bound from below the size of a maximal matching in the pair 
(Tu^b,Wu) with 

where the last inequality follows from our choice of A^. Recall that the matching that we found 
defines the desired set W, concluding the proof. An algorithmic version of this lemma can be 
derived by simply using an algorithmic version of Lemma 13.41 in the selection oi Vi, . . . ,Vk above. 
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3.6 Graphs with diameter 2 

Proof of Theorem 11.41 Consider a random 3-coloring of E, where every edge is colored with 
one of three possible colors uniformly and independently at random. It is enough to prove that for 
all pairs u,v the probability that they are not rainbow connected is at most 1/n^. Then the 
proof follows by the union bound (cf. [2j). 

Let us fix a pair u,v V, and bound from above the probability that this pair is not rainbow 
connected. We know that both r(n) and T(v) (the neighborhoods of u and v) contain at least 
8 log n vertices. 

1. If {u, v} & E then we are done. 

2. If |r(n) nr(t;)| > 21ogn then there are at least 21ogn edge-disjoint paths of length two from 
u to V. In this case, the probability that none of these paths is a rainbow path is bounded by 
(1/3)^ < l/n?, and we are done. 

3. Otherwise, let A = T{u) \ T{v) and B = T{v) \ T{u). We know that \A\, \B\ > 61ogn, and in 
addition, since the first two cases do not hold and the diameter of G is two, all the (length 
two) shortest paths from A^s vertices to v go through the vertices in B. This implies that 
every vertex x (z A has a neighbor b{x) G B {b{x) need not be a one-one function). Let 
us consider the set of at least 61ogn edge-disjoint paths P = {u,x,b{x) : x G A}. For each 
X £ A, the probability that u, x, b{x),v is a rainbow path (given the color of the edge {b{x),v)) 
is 2/9. Moreover, this event is independent of the corresponding events for all other members 
of A, because this probability does not change even with full knowledge of the colors of all 
edges incident with v. Therefore, the probability that none of the paths in P extends to a 
rainbow path from n to is at most (7/9)^ '"S" < l/n?, as required. 

The above proof immediately implies a probabilistic polynomial expected time randomized 
algorithm with zero error probability (since we can also efficiently check if the coloring indeed makes 
G 3-rainbow connected). The algorithm can be derandomized and converted to a polynomial time 
probabilistic algorithm using the method of conditional expectations (cf. [2]) similarly to the proof 
of Lemma 13.61 For every partial coloring of the edges we can efficiently bound the conditional 
probability that a fixed pair u, v is not rainbow-connected, using the relevant one of the three 
cases concerning u and v that were analyzed above. Now we can color the edges one by one, at 
each time taking care not to increase the bound on the conditional expectation of unconnected 
pairs that results from the above probability bound for every u and v. Since the bound on the 
expectation was smaller than 1 before the beginning of the process, in the end we would get a valid 
3-rainbow-coloring of G. ■ 

4 Concluding remarks and open problems 

• Theorem 11.31 asserts that a connected graph with minimum degree at least en has bounded 
rainbow connection. However, the bound obtained is huge as it follows from the Regularity 
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Lemma. It would be interesting to find the "correct" bound. It is even possible that rc{G) < 
C/e for some absolute constant C. 

• The proof of Theorem 11.11 shows that deciding whether rc{G) = 2 is NP-Complete. Although 
this suffices to deduce that computing rc{G) is NP-Hard, we still do not have a proof that 
deciding whether rc{G) < k is NP-Complete for every fixed k. We can easily it for every even 
k by the following reduction from the case k = 2. Given a graph G, subdivide every edge 
into k/2 edges. Now, the new graph G' has rc{G') = k \i and only if rc[G) = 2. Indeed, if 
rc{G) = 2 then take a corresponding red-blue coloring of G and color G' by coloring every 
subdivided red edge of G with the colors 1, . . . , A;/2 and every subdivided blue edge with the 
colors k/2 + 1, . . . ,k. Conversely, if G' has an edge coloring making it rainbow connected using 
the colors 1, . . . ,k, then color each edge e of G as follows. If the subdivision of e contains the 
color 1, color e red; otherwise, color e blue. This red-blue coloring of G makes G rainbow 
connected. 

It is tempting to conjecture that for every k it is NP-Hard even to distinguish between 2- 
rainbow-colorable graphs and graphs that are not even A;-rainbow-colorable. 

• A parameter related to rainbow connection is the rainbow diameter. In this case we ask for 
an edge coloring so that for any two vertices, there is a rainbow shortest path connecting 
them. The rainbow diameter, denoted rd{G) is the smallest number of colors used in such 
a coloring. Clearly, rd{G) > rc{G) and obviously every connected graph with n vertices has 
rd{G) < (2) . Unlike rainbow connection, which is a monotone graph property (adding edges 
never increases the rainbow connection) this is not the case for the rainbow diameter (although 
we note that constructing an example that proves non-monotonicity is not straightforward). 
Clearly, computing rd{G) is NP-Hard since rc{G) = 2 if and only if rd[G) = 2. It would 
be interesting to prove a version of Theorem 11.31 for rainbow diameter. We conjecture that, 
indeed, if G is a connected graph with minimum degree at least en then it has a bounded 
rainbow diameter. 

• Suppose that we are given a graph G for which we are told that rc{G) = 2. Can we rainbow- 
color it in polynomial time with o(n) colors? For the usual coloring problem, this version has 
been well studied. It is known that if a graph is 3-colorable (in the usual sense), then there 
is a polynomial time algorithm that colors it with 0(n^/^^) colors [3]. 
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